Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 992
Filtrar
1.
Nucleic Acids Res ; 52(D1): D963-D971, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37953384

RESUMO

Polygenic score (PGS) is an important tool for the genetic prediction of complex traits. However, there are currently no resources providing comprehensive PGSs computed from published summary statistics, and it is difficult to implement and run different PGS methods due to the complexity of their pipelines and parameter settings. To address these issues, we introduce a new resource called PGS-Depot containing the most comprehensive set of publicly available disease-related GWAS summary statistics. PGS-Depot includes 5585 high quality summary statistics (1933 quantitative and 3652 binary trait statistics) curated from 1564 traits in European and East Asian populations. A standardized best-practice pipeline is used to implement 11 summary statistics-based PGS methods, each with different model assumptions and estimation procedures. The prediction performance of each method can be compared for both in- and cross-ancestry populations, and users can also submit their own summary statistics to obtain custom PGS with the available methods. Other features include searching for PGSs by trait name, publication, cohort information, population, or the MeSH ontology tree and searching for trait descriptions with the experimental factor ontology (EFO). All scores, SNP effect sizes and summary statistics can be downloaded via FTP. PGS-Depot is freely available at http://www.pgsdepot.net.


Assuntos
Bioestatística , Herança Multifatorial , Estudo de Associação Genômica Ampla , Herança Multifatorial/genética , Fenótipo , Polimorfismo de Nucleotídeo Único , Bioestatística/métodos
3.
Int J Biostat ; 19(2): 309-331, 2023 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-37192544

RESUMO

In this work, we examine recently developed methods for Bayesian inference of optimal dynamic treatment regimes (DTRs). DTRs are a set of treatment decision rules aimed at tailoring patient care to patient-specific characteristics, thereby falling within the realm of precision medicine. In this field, researchers seek to tailor therapy with the intention of improving health outcomes; therefore, they are most interested in identifying optimal DTRs. Recent work has developed Bayesian methods for identifying optimal DTRs in a family indexed by ψ via Bayesian dynamic marginal structural models (MSMs) (Rodriguez Duque D, Stephens DA, Moodie EEM, Klein MB. Semiparametric Bayesian inference for dynamic treatment regimes via dynamic regime marginal structural models. Biostatistics; 2022. (In Press)); we review the proposed estimation procedure and illustrate its use via the new BayesDTR R package. Although methods in Rodriguez Duque D, Stephens DA, Moodie EEM, Klein MB. (Semiparametric Bayesian inference for dynamic treatment regimes via dynamic regime marginal structural models. Biostatistics; 2022. (In Press)) can estimate optimal DTRs well, they may lead to biased estimators when the model for the expected outcome if everyone in a population were to follow a given treatment strategy, known as a value function, is misspecified or when a grid search for the optimum is employed. We describe recent work that uses a Gaussian process ( G P ) prior on the value function as a means to robustly identify optimal DTRs (Rodriguez Duque D, Stephens DA, Moodie EEM. Estimation of optimal dynamic treatment regimes using Gaussian processes; 2022. Available from: https://doi.org/10.48550/arXiv.2105.12259). We demonstrate how a G P approach may be implemented with the BayesDTR package and contrast it with other value-search approaches to identifying optimal DTRs. We use data from an HIV therapeutic trial in order to illustrate a standard analysis with these methods, using both the original observed trial data and an additional simulated component to showcase a longitudinal (two-stage DTR) analysis.


Assuntos
Modelos Estatísticos , Medicina de Precisão , Humanos , Teorema de Bayes , Medicina de Precisão/métodos , Bioestatística/métodos
4.
Rev Esp Enferm Dig ; 115(4): 157-159, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36779473

RESUMO

Statistical tests are the foundation on which data analysis for all types of clinical, basic and epidemiological research relies upon. Clinicians need to understand these tests and the basics of biostatistics to correctly relay the data and results from their research and to understand the results of other scientific publications. The first article in this three-part editorial series aims to present, in a clear way, some essential concepts of biostatistics that can assist the gastroenterologist in understanding scientific research, for an evidence-based clinical practice.


Assuntos
Gastroenterologistas , Humanos , Bioestatística/métodos
5.
Lifetime Data Anal ; 29(3): 508-536, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-36624222

RESUMO

The progression of disease for an individual can be described mathematically as a stochastic process. The individual experiences a failure event when the disease path first reaches or crosses a critical disease level. This happening defines a failure event and a first hitting time or time-to-event, both of which are important in medical contexts. When the context involves explanatory variables then there is usually an interest in incorporating regression structures into the analysis and the methodology known as threshold regression comes into play. To date, most applications of threshold regression have been based on parametric families of stochastic processes. This paper presents a semiparametric form of threshold regression that requires the stochastic process to have only one key property, namely, stationary independent increments. As this property is frequently encountered in real applications, this model has potential for use in many fields. The mathematical underpinnings of this semiparametric approach for estimation and prediction are described. The basic data element required by the model is a pair of readings representing the observed change in time and the observed change in disease level, arising from either a failure event or survival of the individual to the end of the data record. An extension is presented for applications where the underlying disease process is unobservable but component covariate processes are available to construct a surrogate disease process. Threshold regression, used in combination with a data technique called Markov decomposition, allows the methods to handle longitudinal time-to-event data by uncoupling a longitudinal record into a sequence of single records. Computational aspects of the methods are straightforward. An array of simulation experiments that verify computational feasibility and statistical inference are reported in an online supplement. Case applications based on longitudinal observational data from The Osteoarthritis Initiative (OAI) study are presented to demonstrate the methodology and its practical use.


Assuntos
Bioestatística , Modelos Estatísticos , Humanos , Processos Estocásticos , Simulação por Computador , Fatores de Tempo , Bioestatística/métodos
6.
Rev. esp. enferm. dig ; 115(4): 157-159, 2023. ilus
Artigo em Inglês | IBECS | ID: ibc-218581
8.
Rev. cuba. inform. méd ; 14(2): e529, jul.-dic. 2022. tab, graf
Artigo em Espanhol | LILACS, CUMED | ID: biblio-1408550

RESUMO

El uso de dispositivos móviles en la vida moderna es imprescindible debido a las ventajas que brindan al ofrecer nuevas posibilidades e implementar de manera virtual servicios ya establecidos. La mayor existencia de móviles que computadoras en los estudiantes de Cuba nos motivó a la realización de esta aplicación. El objetivo de este artículo es describir la aplicación nombrada Cálculos estadísticos y tasas en salud (Calc. Tasas versión 1.7) construida para realizar cálculos en un curso de Bioestadística, cubriendo gran parte del contenido de esta asignatura en la enseñanza de pregrado de las universidades médicas, así como otros contenidos de interés en esta materia. También incorpora una base de datos con información demográfica y sanitaria de Cuba y sus provincias en el período 2013-2020. Como resultado se logró independencia tecnológica al dejar de usar programas foráneos y se logró una mayor portabilidad pues funciona tanto en móviles como en computadoras utilizando un emulador de Android(AU)


The use of mobile devices in modern life is essential due to the advantages they provide, offering new possibilities and implementing virtual services. The existence of greater number of mobiles phones than computers in Cuban students motivated the realization of this application. The objective of the article is to describe the application Statistical calculations and rates in health (Calc. Rates version 1.7) built to perform calculations in a Biostatistics course, covering a large part of the content of this subject in the undergraduate teaching of medical universities, as well as other content related with this topic. It also incorporates a database with demographic and health information on Cuba and its provinces in the period 2013-2020. As a result, technological independence was achieved by stopping using foreign programs and a greater portability, since it works on both mobile phones and computers through an Android emulator(AU)


Assuntos
Humanos , Masculino , Feminino , Computação Matemática , Aplicações da Informática Médica , Linguagens de Programação , Bioestatística/métodos , Aplicativos Móveis , Cuba
9.
PLoS One ; 17(7): e0272007, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35867721

RESUMO

Interval estimation with accurate coverage for risk difference (RD) in a correlated 2 × 2 table with structural zero is a fundamental and important problem in biostatistics. The score test-based and Bayesian tail-based confidence intervals (CIs) have good coverage performance among the existing methods. However, as approximation approaches, they have coverage probabilities lower than the nominal confidence level for finite and moderate sample sizes. In this paper, we propose three new CIs for RD based on the fiducial, inferential model (IM) and modified IM (MIM) methods. The IM interval is proven to be valid. Moreover, simulation studies show that the CIs of fiducial and MIM methods can guarantee the preset coverage rate even for small sample sizes. More importantly, in terms of coverage probability and expected length, the MIM interval outperforms other intervals. Finally, a real example illustrates the application of the proposed methods.


Assuntos
Bioestatística , Modelos Estatísticos , Teorema de Bayes , Biometria , Bioestatística/métodos , Intervalos de Confiança , Tamanho da Amostra
10.
Parasit Vectors ; 15(1): 35, 2022 Jan 24.
Artigo em Inglês | MEDLINE | ID: mdl-35073988

RESUMO

Dose-response relationships reflect the effects of a substance on organisms, and are widely used in broad research areas, from medicine and physiology, to vector control and pest management in agronomy. Furthermore, reporting on the response of organisms to stressors is an essential component of many public policies (e.g. public health, environment), and assessment of xenobiotic responses is an integral part of World Health Organization recommendations. Building upon an R script that we previously made available, and considering its popularity, we have now developed a software package in the R environment, BioRssay, to efficiently analyze dose-response relationships. It has more user-friendly functions and more flexibility, and proposes an easy interpretation of the results. The functions in the BioRssay package are built on robust statistical analyses to compare the dose/exposure-response of various bioassays and effectively visualize them in probit-graphs.


Assuntos
Bioensaio/estatística & dados numéricos , Bioestatística , Software , Animais , Bioestatística/instrumentação , Bioestatística/métodos , Relação Dose-Resposta a Droga , Humanos , Dose Letal Mediana , Saúde Pública/estatística & dados numéricos
14.
Brief Bioinform ; 22(5)2021 09 02.
Artigo em Inglês | MEDLINE | ID: mdl-33822893

RESUMO

A major task in the analysis of microbiome data is to identify microbes associated with differing biological conditions. Before conducting analysis, raw data must first be adjusted so that counts from different samples are comparable. A typical approach is to estimate normalization factors by which all counts in a sample are multiplied or divided. However, the inherent variation associated with estimation of normalization factors are often not accounted for in subsequent analysis, leading to a loss of precision. Rank normalization is a nonparametric alternative to the estimation of normalization factors in which each count for a microbial feature is replaced by its intrasample rank. Although rank normalization has been successfully applied to microarray analysis in the past, it has yet to be explored for microbiome data, which is characterized by high frequencies of 0s, strongly correlated features and compositionality. We propose to use rank normalization as an alternative to the estimation of normalization factors and examine its performance when paired with a two-sample t-test. On a rigorous 3rd-party benchmarking simulation, it is shown to offer strong control over the false discovery rate, and at sample sizes greater than 50 per treatment group, to offer an improvement in performance over commonly used normalization factors paired with t-tests, Wilcoxon rank-sum tests and methodologies implemented by R packages. On two real datasets, it yielded valid and reproducible results that were strongly in agreement with the original findings and the existing literature, further demonstrating its robustness and future potential. Availability: The data underlying this article are available online along with R code and supplementary materials at https://github.com/matthewlouisdavisBioStat/Rank-Normalization-Empowers-a-T-Test.


Assuntos
Bactérias/genética , Infecções Bacterianas/diagnóstico , Bioestatística/métodos , Neoplasias Colorretais/microbiologia , Doença de Crohn/microbiologia , Microbioma Gastrointestinal/genética , Metagenoma , Infecções Bacterianas/microbiologia , Benchmarking , Estudos de Casos e Controles , Criança , Estudos de Coortes , Simulação por Computador , Feminino , Humanos , Masculino , Computação Matemática , Metagenômica/métodos , RNA Ribossômico 16S/genética , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Estatísticas não Paramétricas
15.
Hist Philos Life Sci ; 43(1): 13, 2021 Feb 02.
Artigo em Inglês | MEDLINE | ID: mdl-33528820

RESUMO

From 1950 to 1952, statisticians W.G. Cochran, C.F. Mosteller, and J.W. Tukey reviewed A.C. Kinsey and colleagues' methodology. Neither the history-and-philosophy of science literature nor contemporary theories of interdisciplinarity seem to offer a conceptual model that fits this forced interaction, which was characterized by significant power asymmetries and disagreements on multiple levels. The statisticians initially attempted to exclude all non-technical matters from their evaluation, but their political and personal investments interfered with this agenda. In the face of McCarthy's witch hunts, negotiations with Kinsey and his funding institutions became integral to the review group's work. This paper analyzes the heavy burden of emotional and affective labor in this collaboration, the conflicts caused by competing visions of objectivity, and the uses of statistical knowledge to gain and sustain authority. Kinsey's refusal to adopt the recommended probability sample damaged his already precarious position even further and marked him as a biased researcher who put his personal agenda above methodological rigor. Kinsey's uncooperative demeanor can be explained by distrust resulting from numerous adverse reactions to his work and by fear of having his sexuality exposed. This case study illustrates that the very concept of valid numbers can become an arena for power struggles and that quantification alone does not guarantee productive exchanges across disciplines. It calls for a deeper conceptual analysis of the prerequisites for successful scientific collaborations.


Assuntos
Pesquisa Biomédica/história , Bioestatística/história , Psiquiatria/história , Comportamento Sexual , Bioestatística/métodos , História do Século XX , Humanos
16.
PLoS One ; 16(1): e0245499, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33449937

RESUMO

BACKGROUND: The syndrome of inappropriate antidiuretic hormone secretion (SIADH) is one of the most common causes of hypotonic hyponatremia. In our previous work, we have derived a novel model (Voets equation) that can be used by clinicians to predict the effect of crystalloid intravenous fluid therapy on the plasma sodium concentration in SIADH. METHODS: In this retrospective chart review, the predictive accuracy of the Voets equation and the Adrogue-Madias equation for the plasma sodium response to crystalloid infusate was compared for fifteen plasma sodium response measurements (n = 15) in twelve SIADH patients. The medical records of these patients were accessed anonymously and none of the authors were their treating physicians. The Pearson correlation coefficient r and corresponding p-value were calculated for the predictions by the Voets model compared to the measured plasma sodium response and for the predictions by the Adrogue-Madias model compared to the measured plasma sodium response. RESULTS AND CONCLUSION: The presented results show that the Voets model (r = 0.94, p < 0.001) predicted the aforementioned plasma sodium response significantly more accurately than the Adrogue-Madias model (r = 0.49, p = 0.07) in SIADH patients and could therefore be a clinically useful addition to the existing prediction models.


Assuntos
Bioestatística/métodos , Hidratação , Síndrome de Secreção Inadequada de HAD/sangue , Síndrome de Secreção Inadequada de HAD/terapia , Sódio/sangue , Administração Intravenosa , Idoso , Idoso de 80 Anos ou mais , Feminino , Humanos , Masculino , Estudos Retrospectivos
17.
Methods Mol Biol ; 2194: 77-105, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-32926363

RESUMO

Survival analysis is tremendously powerful, and is a popular methodology for analyzing time to event models in bioinformatics. Furthermore, several of its extensions can simultaneously perform variable selection in conjunction with model estimation. While this flexibility is extremely desirable, under certain scenarios, binary class variable selection and classification methods might provide more reliable risk estimates. Synthetic simulations and real data case studies suggest that when (1) randomly censored points comprise only a small portion of data, (2) biological markers are weak, (3) it is desired to compute risk across predetermined time intervals, and (4) the assumptions of the competing time to event models are violated, binary class models tend to perform superior. In practice, it might be prudent to test both model families to guarantee adequate analysis. Here we describe the pipeline of binary class feature selection and classification for time to event risk assessment.


Assuntos
Bioestatística/métodos , Biologia Computacional/métodos , Neoplasias/mortalidade , Algoritmos , Análise de Variância , Simulação por Computador , Interpretação Estatística de Dados , Análise Discriminante , Humanos , Modelos Lineares , Prognóstico , Medição de Risco/métodos , Máquina de Vetores de Suporte , Análise de Sobrevida
18.
Methods Mol Biol ; 2194: 187-221, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-32926368

RESUMO

Highly collaborative scientists are often called on to extend their expertise to different types of projects and to expand the scope and scale of projects well beyond their previous experience. For a large-scale project involving "big data" to be successful, several different aspects of the research plan need to be developed and tested, which include but are not limited to the experimental design, sample collection, sample preparation, metadata recording, technical capability, data acquisition, approaches for data analysis, methods for integration of different data types, recruitment of additional expertise as needed to guide the project, and strategies for clear communication throughout the project. To capture this process, we describe an example project in proteogenomics that built on our collective expertise and experience. Key steps included definition of hypotheses, identification of an appropriate clinical cohort, pilot projects to assess feasibility, refinement of experimental designs, and extensive discussions involving the research team throughout the process. The goal of this chapter is to provide the reader with a set of guidelines to support development of other large-scale multiomics projects.


Assuntos
Bioestatística/métodos , Pesquisa Interdisciplinar/métodos , Proteogenômica/métodos , Big Data , Estudos de Coortes , Expressão Gênica , Genômica/métodos , Humanos , Projetos Piloto , Proteômica/métodos , Projetos de Pesquisa
19.
Methods Mol Biol ; 2231: 121-134, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33289890

RESUMO

Sequence alignment is at the heart of DNA and protein sequence analysis. For the data volumes that are nowadays produced by massively parallel sequencing technologies, however, pairwise and multiple alignment methods are often too slow. Therefore, fast alignment-free approaches to sequence comparison have become popular in recent years. Most of these approaches are based on word frequencies, for words of a fixed length, or on word-matching statistics. Other approaches are using the length of maximal word matches. While these methods are very fast, most of them rely on ad hoc measures of sequences similarity or dissimilarity that are hard to interpret. In this chapter, I describe a number of alignment-free methods that we developed in recent years. Our approaches are based on spaced-word matches ("SpaM"), i.e. on inexact word matches, that are allowed to contain mismatches at certain pre-defined positions. Unlike most previous alignment-free approaches, our approaches are able to accurately estimate phylogenetic distances between DNA or protein sequences using a stochastic model of molecular evolution.


Assuntos
Bioestatística/métodos , Genômica/métodos , Análise de Sequência de DNA/métodos , Análise de Sequência de Proteína/métodos , Software , Algoritmos , Evolução Molecular , Filogenia , Alinhamento de Sequência
20.
PLoS One ; 15(12): e0241427, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33347441

RESUMO

In the last decades, statistical methodology has developed rapidly, in particular in the field of regression modeling. Multivariable regression models are applied in almost all medical research projects. Therefore, the potential impact of statistical misconceptions within this field can be enormous Indeed, the current theoretical statistical knowledge is not always adequately transferred to the current practice in medical statistics. Some medical journals have identified this problem and published isolated statistical articles and even whole series thereof. In this systematic review, we aim to assess the current level of education on regression modeling that is provided to medical researchers via series of statistical articles published in medical journals. The present manuscript is a protocol for a systematic review that aims to assess which aspects of regression modeling are covered by statistical series published in medical journals that intend to train and guide applied medical researchers with limited statistical knowledge. Statistical paper series cannot easily be summarized and identified by common keywords in an electronic search engine like Scopus. We therefore identified series by a systematic request to statistical experts who are part or related to the STRATOS Initiative (STRengthening Analytical Thinking for Observational Studies). Within each identified article, two raters will independently check the content of the articles with respect to a predefined list of key aspects related to regression modeling. The content analysis of the topic-relevant articles will be performed using a predefined report form to assess the content as objectively as possible. Any disputes will be resolved by a third reviewer. Summary analyses will identify potential methodological gaps and misconceptions that may have an important impact on the quality of analyses in medical research. This review will thus provide a basis for future guidance papers and tutorials in the field of regression modeling which will enable medical researchers 1) to interpret publications in a correct way, 2) to perform basic statistical analyses in a correct way and 3) to identify situations when the help of a statistical expert is required.


Assuntos
Pesquisa Biomédica/estatística & dados numéricos , Modelos Estatísticos , Análise de Regressão , Viés , Pesquisa Biomédica/educação , Bioestatística/métodos , Coleta de Dados , Gerenciamento de Dados , Ciência de Dados/educação , Ciência de Dados/estatística & dados numéricos , Humanos , Estudos Observacionais como Assunto , Publicações Periódicas como Assunto
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...